Environmental data mining and modeling based on machine learning algorithms and geostatistics

نویسندگان

  • Mikhail F. Kanevski
  • Roman Parkin
  • Aleksey Pozdnukhov
  • Vadim Timonin
  • Michel Maignan
  • Vasiliy V. Demyanov
  • Stéphane Canu
چکیده

The paper presents some contemporary approaches to the spatial environmental data analysis, processing and presentation. The main topics are concentrated on the decision–oriented problems of environmental and pollution spatial data mining and modelling: valorisation and representativity of data with the help of exploratory data analysis, topological, statistical and fractal measures of monitoring networks, spatial predictions and classifications, probabilistic and risk mapping, development and application of conditional stochastic simulation models. The set of tools used consists of machine learning algorithms (MLA) – Multilayer Perceptron, General Regression Neural Networks, Probabilistic Neural Networks, Radial Basis Function Networks, Support Vector Machines and Support Vector Regression, and recently developed geostatistical predictive and simulation models. The innovative part of the report deals with integrated/hybrid models, including ML Residuals Kriging/Cokriging predictions, ML Residuals Simulated Annealing/Sequential Gaussian simulations. The objective of the integrated models is twofold: from one side ML algorithms efficiently solve problems of spatial non-stationarity, which are difficult for geostatistical approach; from another side geostatistical tools are widely and successfully applied to characterise the performance of the ML algorithms, analysing the quality and quantity of the spatially structured information extracted from data by ML. Moreover, mixture of ML data driven and geostatistical model based approaches are attractive for decision-making process.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine learning algorithms in air quality modeling

Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...

متن کامل

Evaluating machine learning methods and satellite images to estimate combined climatic indices

The reflections recorded on satellite images have been affected by various environmental factors. In these images, some of these factors are combined with other environmental factors that cannot be distinguished. Therefore, it seems wise to model these environmental phenomena in the form of hybrid indicators. In this regard, satellite imagery and machine learning methods can play a unique role ...

متن کامل

Accuracy Improvement of Mood Disorders Prediction using a Combination of Data Mining and Meta-Heuristic Algorithms

Introduction: Since the delay or mistake in the diagnosis of mood disorders due to the similarity of their symptoms hinders effective treatment, this study aimed to accurately diagnose mood disorders including psychosis, autism, personality disorder, bipolar, depression, and schizophrenia, through modeling and analyzing patients' data. Method: Data collected in this applied developmental resear...

متن کامل

Personal Credit Score Prediction using Data Mining Algorithms (Case Study: Bank Customers)

Knowledge and information extraction from data is an age-old concept in scientific studies. In industrial decision-making processes, the application of this concept gives rise to data-mining opportunities. Personal credit scoring is an ever-vital tool for banking systems in order to manage and minimize the inherent risks of the financial sector, thus, the design and improvement of credit scorin...

متن کامل

Application of ensemble learning techniques to model the atmospheric concentration of SO2

In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Environmental Modelling and Software

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2004